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Abstract 

In the game of Matching Pennies, Alice and Bob each hold a penny, 
and at every tick of the clock they simultaneously display the head or the 
tail sides of their coins. If they both display the same side, then Alice 
wins Bob’s penny; if they display different sides, then Bob wins Alice’s 
penny. To avoid giving the opponent a chance to win, both players seem 
to have nothing else to do but to randomly play heads and tails with equal 
frequencies. However, while not losing in this game is easy, not missing 
an opportunity to win is not. Randomizing your own moves can be made 
easy. Recognizing when the opponent’s moves are not random can be 
arbitrarily hard. 

The notion of randomness is central in game theory, but it is usually 
taken for granted. The notion of outsmarting is not central in game theory, 
but it is central in the practice of gaming. We pursue the idea that these 
two notions can be usefully viewed as two sides of the same coin. 


1 Introduction 

1.1 Game of Matching Pennies 

The payoff matrix for Matching Pennies is displayed on Table [TJ For the con¬ 
venience of using the bitstring notations, we denote the heads move as 0 and 
the tails move as 1. The game is repeated, and we assume that it is played 
long enough that even the smallest strategic advantages get expressed in the 
outcome. Both players can win or lose arbitrarily large amounts of pennies. 

1.2 How not to lose Matching Pennies 

To determine her strategy, Alice might reason something like this. 

Suppose that I consistently play 1 with a frequency p G [0,1] and 
thus 0 with a frequency 1 — p. If I set p < 1, then Bob can get the 
expected payoff —p -b (I — p) = 1 — 2p > 0 by playing 1. If I set 
p > i, then Bob can get the expected payoff p — (I —p) = 2p — I > 0 
by playing 0. If I set p = 5, then Bob’s expected payoff is the same 


1 


0 1 



-1 

1 

0 

1 

-1 


1 

-1 

1 

-1 

1 


Table 1: Payoffs for Matching Pennies 


whether he plays 1 or 0: it is 1 — 2p = 2p — 1 = 0. Since Bob’s 
winnings are my losses, the best strategy for me is to set p = 
and to play 0 and 1 with equal frequencies, since that minimizes my 
expected losses. 

By the same reasoning. Bob arrives at the same conclusion, that he should set 
the frequency of playing 1 at g = ^. This is the well known Nash equilibrium 
of the game of Matching Pennies. Both players arrive to it by minimizing the 
expected losses. 

1.3 Playing Matching Pennies 

In general, a mixed strategy Nash equilibrium prescribes the frequencies for both 
players’ moves in the long run. The essential assumption is that the moves will 
be randomized. If Bob’s move is predictable with some likelihood, then Alice 
can increase her chances to win. It seems natural to imagine that the players 
randomize by tossing their coins, and displaying the random outcomes. At the 
equilibrium, the players are just passive servants of chance, since they cannot 
gain anything by deviating from it. If they are rational, all they can do is toss 
their coins. 

But suppose that Bob suddenly plays 

0101010101010101010101010101010101010101 ( 1 ) 

Will Alice predict that Bob’s next move is 0 and play 0 to win a penny? If she 
thinks probabilistically, she will probably notice that the probability of getting 
m by flipping a fair coin is 2 which is exactly the same as the chance of 
getting, e.g. 

1101000100110101001011100100000100000010 (2) 

or any other sequence that she would accept as random. If Alice’s rationality is 
based on probabilities, then she will not be able to distinguish any two strings 
of Bob’s moves, since they are all equally probable if he tosses fair coins. 

But if Bob knows or just believes that Alice’s rationality is based on prob¬ 
abilities, and that Alice will thus continue to randomize her moves in any case, 
then Bob has no reason to randomize, since playing O, or ([2]), or a string of 
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Os, or any other string, yield the same expected payoff against Alice’s random 
plays. On the other hand, if Alice believes that Bob’s rationality is based on 
probabilities, then she will also have no reason to randomize, since her payoffs 
are always the same. 

So by combining their beliefs about their probabilistic reasoning, both play¬ 
ers will become indifferent towards mixing and randomizing their moves. Their 
common knowledge that they may both stop randomizing, because they both 
know that the opponent will be unable to tell, will not change their expected 
payoffs. Indeed, if they both play non-randomly, one of them will almost surely 
win and the other will lose, but their chances are the same, and they aver¬ 
age out. However, while the expected payoffs remain unchanged, the higher 
moments will, of course, diverge wildly. 

1.4 How to win Matching Pennies if you can 

In order to exploit Bob’s deviation from the equilibrium, or to give him an 
incentive to genuinely randomize his mixed equilibrium strategy, Alice must go 
beyond probabilities, i.e. beyond just calculating the frequency of his moves. If 
she just checks whether the frequencies of 0 and 1 are she will detect that 
the string consisting Os alone is not random, but not that the string (ED is not 
random; if she checks whether the frequencies of 00, 01, 10 and 11 are she will 
detect that ED is not random, but not that the string where these four digraphs 
of bits alternate is not random; etc. By checking that each bitstring of length n 
has in the long run the frequency she will detect many non-random plays, 
but still miss most of them. E.g., the string 

0110111001011101000100110101100110111101111... (3) 

obtained by concatenating the binary notations for the sequence of natural 
numbers 0,1,2,3... will pass the bias tests for all n-grams, if taken long enough, 
yet it is, of course, easily predictable, and obviously not random. Moreover, 
Bob might, e.g., randomize all even bits, and just alternate Os and Is at the 
odd positions. To recognize such opportunities, Alice will have to check that 
every substring of the string of Bob’s past moves has unbiased frequencies of all 
n-grams. As the game goes on, Alice will thus have to keep proving that Bob’s 
play, i.e. the ever growing string of his past moves, is what von Mises called 
Kollektiv in his theory of probability [28) . Proving that something is a Kollektiv 
is known to be a problematic task, as specifying the substrings to be tested has 
led to problems that remained open for many years [md]. 

1.5 Randomness from eqnilibrinm 

Scratching the surface of the basic assumption about the players’ incentives to 
implement a mixed strategy equilibrium led us straight into the foundationals 
of probability. There is, of course, nothing very surprising about the fact that 
the concept of a mixed strategy, expressed in terms of probabilities, depends on 
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the foundations of probability. The point of the story is not so much that there 
are deep probabilistic problems lurking behind simple games. Although easily 
ignored, that fact should not be surprising. A more useful point that we would 
like to make is that, the other way around, there seem to be instructive ways 
to state the solutions of the foundational problems of probability in terms of 
games, simple and intuitive. 

In particular, we show that the usual way of defining mixed strategy equi¬ 
libria based on the notion of randomness (and hiding the subtleties and the 
complexities of this notion under the carpet) can be reversed — and that the 
notion of randomness can be defined the other way around, using mixed strategy 
equilibria. The upshot of this is not just that a complicated concept of random¬ 
ness is replaced by an intuitive game theoretic concept of not losing Matching 
Pennies at the equilibrium; the upshot is also that the effective content of both 
concepts, of randomness and of equilibrium, can be analyzed in terms of ran¬ 
domness tests, which bring both probabilistic and game theoretic concepts on 
the mathematical ground of inductive inference 0113111111 [30] — with a great 
potential for cross-polination of ideas. 

1.6 Background and related work 

We propose a simple and narrow bridge between games and probabilities. An 
extensive effort towards reconstructing the foundations of probability theory 
from a particular game, has been ongoing for many years, as reported by Shafer 
and Vovk [24] . The work presented in this paper is not only at the opposite end 
of the scale in terms of its scope and technical sophistication, but it also goes 
in a different direction, and therefore uses an essentially different model. While 
the authors of [H] aim to reconstitute the full power of the diverse probabilistic 
tools in their rich gaming model, the point here is to illustrate how the most 
basic games capture the most basic probability concepts in a completely natural 
fashion. 

The bridge between games and probabilities is built using significance testing 
and computation. Significance testing goes back to Fisher 00 and lies, of 
course, at the core of the method of statistical induction. The constructions 
sketched here are related to the computational versions of testing, developed on 
one hand in Martin-Lof’s work nnin], and on the other hand in the techniques 
of inductive learning 01113]. 

We analyze the computational content of testing. The analyses of the com¬ 
putational content of strategic reasoning go back to the earliest days of game 
theory [21] . and continue through theory of bounded rationality [23] . and on 
a broad front of algorithmic game theory [18] . The finite state machine model 
seems preferred for specifying strategies [130, since computable strategies lead 
to problems with the equilibrium constructions [261116] . In recent work, a dif¬ 
ferent family of problems, arising from the cost of strategic computations has 
been analyzed, including the cost of randomization 00- This led the players 
to not just lose the incentive to randomize, as in the little story above, but 
to prefer determinism. Although we are here also looking at the problem of 
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deviating from the equilibrium into non-randomness, we are concerned with a 
completely different question: How should the opponent recognize and exploit 
this deviation? The present work seems to deviate from all previous compu¬ 
tational approaches to gaming in one essential aspect: we are not analyzing 
the computations that the players perform to construct or implement their own 
strategies, or the equilibrium, but the computations that they perform to test 
the opponents’ strategies. This leads into a completely different realm of com¬ 
putability, that emerges from a different aspect of gaming. While the analysis 
goes through for most models of computation, abstracted away behind a family 
of computations £ (as explained in Sec 12.31) . it is perhaps worth pointing out 
that it leads to different notions of randomness, and that stronger notions of 
computation lead to stronger notions of randomness. 

1.7 Outline of the paper 

In Sec. [2] we spell out the preliminaries and some notations used in the paper. 
In Sec.[3]we motivate and explain the simplest case of randomness testing, with 
respect to the uniform distributions, and describe its application in gaming. 
In Sec. |4] we construct the universal test that can be constructed when all 
computable partial functions can be used. Sec. [5] derives as a corrolary the 
characterization of random strings as the equilibrium plays. In Sec.[S]we describe 
how to construct randomness tests for arbitrary programmable distributions. 
Sec. [3 closes the paper with some final comments. 

2 Notations and preliminaries 

2.1 Monoid of plays 

In the games considered in this paper, the set of moves is always 2 = {0,1}. We 
sometimes call 0 heads and 1 tails. A play is a finite string (or list, or vector) 
of moves x = X 1 X 2 X 3 ■ ■ ■ Xm, or y = yi?/ 22/3 ■ ■ - yn played in a match of a game. 
The set of all bitstrings, used to represent plays, is denoted by 2*. The empty 
bitstring is (), and the concatenation of bitstrings \s x v. y = xi ■ ■ ■ Xmyi ■ ■ - yn 
constitute the monoid (2*,::, ()), which is freely generated by 2. The monoid 
structure induces the prefix ordering 

3.Z. X :: z = y (4) 

and the length measure £ : 2* —> N, which is the unique homomorphism from 
the free monoid over two generators to the free monoid over one generator. The 
fact that it is a homomorphism means that 

£{) = 0 and £ (x :: y) = £ (x) + £ (y) 

We shall also need a bijective pairing : 2* x 2* —> 2* with the projections 

— (0), —( 1 ) : 2* —2*, which means that together they satisfy 


5 


Using the fact that a free monoid is also cofree, a bijective pairing can be derived 
from any two disjoint injections 2* 2*. We shall use 

{x,y) = XiXiX2X2---XmXm^lyiy2---yn (5) 

where x = xiX 2 ■ ■ ■ Xm and y = yiy 2 ■ ■ ■ y-n- The length induces the shift homo¬ 
morphism 

^((uy)) = 2 £{x)+£{y )+2 (6) 

2.2 Sets and functions 

|X| denotes the number of elements of the set X. A functions / : A —>■ U is 
assumed to be total, whereas a partial function is written duS h : X —r Y. We 
write h{x) f to say that the partial function h is defined on the input x, and 
h{x) t or h{x) =t to say that h is undefined on x. Whenever we write h{x) in 
a formula or a definition, we tacitly assume h(x) j,, unless specified otherwise. 

2.3 Programmable functions 

We say that / : 2* ^ 2* is C-programmable, or that it is an C-function when it is 
specified using a programming language C. The intuitions from reader’s favorite 
programming language, practical or theoretical, should do. For a theoretical 
example, one could take £ to be the language of finite state machines. A program 
could then be either a list of transitions of a Moore or Mealy machine, or a 
corresponding regular expression mM- The graphs of programmable functions 
would be regular languages. A larger family or programmable functions would 
be obtained from a Turing complete programming language, like Python or 
Java, or the Turing machines themselves. In the latter case, a program could 
again be a list of the transitions of the machine. The intermediary solution 
might be to use computations specified in the language of monoidal computers 
[T9l[20]. In any case, all computable partial functions would be programmable, 
together with their recursively enumerable graphs. 

In all cases, we assume that a program corresponding to a given £-function 
f : X ^ Y is effectively given as a bitstring pf G 2*. When the language £ is 
Turing complete, then for all types x and Y there is an £-function v : 2* x X ^ 
Y, the universal evaluator, such that for every £-function f : X —r Y holds 
f{x) = v(pf,x) for the suitable program p/ and all inputs x G X. 

3 Uniform randomness testing 

We focus on Alice’s task to detect patterns of non-randomness in Bob’s play, 
which she could exploit to predict his moves. Bob is assumed to be doing the 
same, observing Alice’s play and trying to detect some patterns. But what is a 
pattern? And what does it mean to detect it? 
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Intuitively, an object has a pattern if it can be described succinctly, i.e. com¬ 
pressed. E.g. the string (HD can be compressed to (01)^° in a regular notation, 
or to 

for (i=0; i <20; i-f-b) { print 01 } 

in a programming language like Java. The program to extend (HD infinitely 
would be (01)* or 

for (;;) { print 01 } 

and the program to output ([3]) would be 

for (i=0;;i-|—f) { print i } 

On the other hand, a program to output the string ([5]), without a detectable 
pattern, would have to spell it out in full length: 

print 110100010011010100101110010000010000001001 

The idea that randomness can be defined as incompressibility goes back to 
Kolmogorov m, and further back to the scholastic logical principle known 
as Occam’s Razor, which established the priority of succinct descriptions as 
inductive hypotheses, as explained by Solomonoff [35] . 

3.1 Detecting regularity, testing randomness 

Definition 3.1 Let C he a family of programmable (partial) functions. An C- 
detector is an C-function h : 2* ^ “2* such that 


h{x) =y e(x) < e{y) 


(7) 


A detector h is predictive if 



( 8 ) 


where C is the prefix ordering O- 


A bitstring y is said to be h-regular at the level to G N z/ 


3x. h{x) = y A £(x) + m < £{y) 


(9) 


The h-regular bitstrings at each level form the h-regularity sets 


77m = {y e 2"|3x. /i(f) = y At(T)-bTO < f(y)} 


( 10 ) 


oo 



( 11 ) 


n—1 


which all together form the tower 


Hi2 H22H3 2- --2 Hm 2 ■■■ 


( 12 ) 


This tower is what we call the h-test. 
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Remark. Note that a bitstring of length n can only be regular at the level m 
if TO < n. The /i-regularity sets for m > n are empty. 

Proposition 3.2 The h-regularity sets decrease exponentially, with 

|i?”l < 2"— (13) 

Proof. By ([T0|) . for every y G there is x such that h{x) = y and £{x)+m < 
£{y), and thus £{x) <n — m, since y G 2". All y G are thus enumerated by 
the strings x of lengths at most n — m. Hence (HI. □ 

Remark. Proposition 13.21 says that the chance that an observation y is h- 
regular at the level to decreases exponentially in to. Since this is true for all 
detectors, the implication is that most bitstrings are irregular: most detectors 
are eventually rejected, and most bitstrings are accepted as random. This is a 
formal expression of Laplace’s observation that regular objects constitute a null 
set [13]. 

Definition 3.3 The /i-regularity degree Chiy) is the highest h-regularity level 
that the bitstring y achieves, i.e. 

ah{y) = max{TO < £{y) \ y G Hm} 

The h-regularity degree is thus a function ah ■ 2* —> N. 

3.2 Alice tests Bob 

Alice uses detectors as tools for hypothesis testing, e.g. in the style of jUH]. She 
tests the hypothesis that Bob’s play y is random. She will reject the hypothesis if 
she finds a detector h such that y is /i-regular at a sufficiently high level. So Alice 
tests Bob by going down the h-test tower Hi H 2 ■ ■ ■, and checking how far 
is it true that y G Hm- This will cease to be true at the level ah{y) < £{y)- If 
the number ah{y) is high, then y is probably regular, and the hypothesis that 
it is random should be rejected. 

To decide when to reject the randomness hypothesis, Alice first of all stip¬ 
ulates a significance threshold M, and considers the outcome of testing y by 
h if ah{y) > M. Since this significance threshold is the negative logarithm of 
the usual statistical thresholds, which are usually 1% or 5%, then Alice should 
presumably take M to be a number between 4 and 7. 

If she finds a test that detects a significant degree of regularity in y, then 
Alice attempts to exploit the detected pattern to gain advantage in the game. 
If the detector h is predictive, then Alice extends the succinct Ii-description x 
of y into a succinct descripton z Zi x, from which she extrapolates a prediction 
h{^ □ y of Bob’s future play, and plays /i(z) herself, since she wins whenever 
they play the same. 

Note that the definition of predictive detectors implies that ahih{z)) > 
ah (y), which means that Alice’s choice of possible predictions narrows as the 
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observed bitstrings are getting longer and the regularity levels higher: more sig¬ 
nificant testing outcomes provide more information about the tested bitstring, 
and better fitting predictions. The usual problems of statistical inference enter 
scene. However, statistical tests derived from detectors, as ^-programmable 
functions, also have some very special features, which we consider next. 


4 Universal detector 


If the C is the family of all computable partial functions, then it contains a 
universal evaluator, as explained in Sec. [2] This means that the programs for 
all detectors can be enumerated. By dovetailig, i.e. executing increasing parts of 
increasing detectors, we can construct a universal detector, which will eventually 
detect any pattern that any computable detector can detect. The test derived 
from the universal detector can be viewed as the universal randomness test. The 
idea and the first construction goes back to Per Martin-L6f Ha- 

Definition 4.1 An C-detector u : 2* ^ 2* is universal if u-testing can detect 
as significant, up to a suitably modified threshold, the significant outcomes of 
h-testing for any C-detector h. More precisely, for every C-detector h : —r “2* 

there is a constant Ch such that for every bitstring x holds 

cThiy) < Ch-\-(Tu{y) (14) 

Proposition 4.2 There is a universal C-detector if C is the family of all com¬ 
putable partial functions. 


Proof. Let u : 2* x 2* ^ 2* be the universal evaluator. This means that for 
every computable function / : 2* ^ 2* there is a program pf such that 


Define 


/(f) = v{pf,x) 


u{x) 


v{xo,xi) if f(f) < f (u (fo,fi)) 
t otherwise 


Then u : 2* ^ 2* is a detector by Def. 13.11 For any detector h : 2* ^ 2* and 
any bitstring f then holds 


h{x) = u{{ph,x)) 

where ph is a program encoding h. Using the pairing 
Then for Ch = 2i{ph) -\-2 holds 

Ch £(x) -hm < £(u {(ph, f))) = £ (^(f)) 

which means that u-regularity at the level m implies /i-regularity at the level 
m + Ch, i.e. C Hence (HH). □ 
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5 Matching Pennies randomness 

The notion of randomness as incompressibility, as formalized by Kolmogorov 
|12j and developed in algorithmic information theory |14) . has been justified 
by Martin-Ldf’s proof that incompressible strings are just those that pass all 
randomness tests [muTnii]. But we have seen that randomness tests are also 
a part of playing Matching Pennies. The players stay at the equilibrium only as 
long as their plays pass each other’s tests. Whenever a test produces a signihcant 
outcome, the randomness hypothesis is rejected, and the players depart from the 
equilibrium, whether the detected pattern was a real consequence of someone’s 
earlier deviation from the equilibrium, or whether the test overfitted a pattern 
onto an actually random string. The equilibrium persists only if both player’s 
plays pass both players’ tests. 

Corollary 5.1 A hitstring is uniformly random (in the sense of Kolmogorov 
UMUM) if and only if it can occur as a play of the equilibrium strategy in the 
game of Matching Pennies. 

Comment. The point of this corollary, and of this paper is that randomness 
tests can be viewed as a part of a mixed strategy equilibrium, and that they 
are a necessary part. Although very minor, this observation does seem worth 
attention. Not just because taking randomness for granted results in swiping 
under the carpet many interesting and important problems with the equilibrium 
constructions. More importantly, taking the randomness testing for granted 
hides from sight the whole wide area of players’ strategic analyses of each other’s 
plays, which is where the essence of real gaming as competition is played out. If 
Alice’s play pass Bob’s tests, but Alice’s tests detect the regularity behind Bob’s 
play, then Alice will win by outsmarting Bob. Randomness and outsmarting are 
two sides of the same coin. That seems worth attention. 


6 General randomness testing 

Definition 6.1 A string distribution is an L-programmabl^ function P : 2* — ; 
[0,1] such that 

P() = l Pix) = Pix :: 0) + Pix :: 1) 

Definition 6.2 Given a string distribution P : 2* —> [0,1], a P-detector with 
respect to C is an C-function hp : “2,* ^ 2* such that 

hp{x) =y => £{x) < £(y) A P{x) > P{y) (15) 

A bitstring y is said to be hp-regular at the level m € N whenever 

3x. hp{x)=y A i{x)+m<l{y) A P{x) > 2"^ ■ P{y) (16) 

^The programmability of a real function p can be defined in different ways. The presenta- 
tion going back to constructive analysis is to present P as a program pp : 2* —> 2* which 
for each x outputs a program pp{x) which streams the digits of the real number P{x) € [0,1]. 
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The hp-regular bitstrings at each level form the /ip-regularity sets 

Hm = {y G 2” I hp{x) = y A 

A £{x)m < £{y) A P{x) > 2"^ ■ P{y)] (17) 

The sets Hm = U^i Hff form the hp-test 

Proposition 6.3 The p-size of hp-regularity sets decreases exponentially with 
m 

E ^ 2— (18) 

yeH^ 

Proof. By definition of for every y S there is a: S 2 "“"^ such that 
P{y) < 2~^P{x). It follows that 

E < E 2-™ • P{x) < 2— 


The search for non-random patterns, deviating from a given string distribu¬ 
tion P, proceeds just like the search for patterns deviating from the uniform 
distribution in Sec. 13.21 When £ is the family of all computable functions, there 
is a universal detector again, constructed by extending the definition in the 
proof of Prop. Hm in the obvious way. 

7 Concluding remarks 

The starting point of this paper was the observation that finding and playing 
one’s own strategy is often much easier than recognizing and understanding 
other players’ strategies. In particular, randomizing is much easier than testing 
randomness. On the other hand, knowing that the opponent keeps an eye on 
how you play is necessary for the implementations of many equilibrium concepts, 
usually assumed implicitly. In order to stay at an equilibrium, the players must 
test each other. But capturing their tests opens an alley towards modeling 
competition, outsmarting, and deceit, which are prominent in the practice of 
gaming, but often ignored in game theory. We believe that the tools are readily 
available to tackle this interesting and important aspect of gaming. 

Players’ randomness testing of each other’s plays turned out to be an intu¬ 
itive characterization of the notion of randomness. It is perhaps worth emphasiz¬ 
ing here that the players with different computational powers recognize different 
notions of randomness. More precisely, different families of programmable func¬ 
tions £ induce different detectors, different tests, different notions of random¬ 
ness, and different implementations for the mixed strategy equilibria. Restrict¬ 
ing the family £ to the language of regular expressions or finite state machines 
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would give a weak but interesting notion. The detectors could be implemented 
along the lines of the familiar compression algorithms, such as those due to Lem- 
pel, Ziv and Welch However, since there is no such thing as a universal 

finite state machine, capable of evaluating all finite state machines, such tests 
based on regular languages would have to be specified one at a time, and sought 
ad hoc. In contrast, taking £ to be a Turing complete language, such as the 
language of Turing machines themselves, allows constructing a nniversal ran¬ 
domness test, which Alice could implement as a universal detector from Sec. SI 
This leads to the canonical notion of randomness spelled out by Kolmogorov, 
Martin-L6f and Solovay, and characterized in Corollarv l5.ll Although the simple 
dovetailing technique used to construct the universal detector in Corollary 15.II 
quickly leads beyond the realm of what is nowadays considered feasible com¬ 
putation, the modern methods of algorithmic learning and statistical induction 
are nevertheless built upon them [IIllllSS]. Learning, evolution and gaming 
are computational processes that do not happen just among humans, but also 
in nature. Even very slow computations in sufficiently large populations and at 
sufficiently large time scales produce their results eventually. 
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